Search CORE

9 research outputs found

Recommended from our members

Fast, Scalable, and Accurate Algorithms for Time-Series Analysis

Author: Paparrizos Ioannis
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2018
Field of study

Time is a critical element for the understanding of natural processes (e.g., earthquakes and weather) or human-made artifacts (e.g., stock market and speech signals). The analysis of time series, the result of sequentially collecting observations of such processes and artifacts, is becoming increasingly prevalent across scientific and industrial applications. The extraction of non-trivial features (e.g., patterns, correlations, and trends) in time series is a critical step for devising effective time-series mining methods for real-world problems and the subject of active research for decades. In this dissertation, we address this fundamental problem by studying and presenting computational methods for efficient unsupervised learning of robust feature representations from time series. Our objective is to (i) simplify and unify the design of scalable and accurate time-series mining algorithms; and (ii) provide a set of readily available tools for effective time-series analysis. We focus on applications operating solely over time-series collections and on applications where the analysis of time series complements the analysis of other types of data, such as text and graphs. For applications operating solely over time-series collections, we propose a generic computational framework, GRAIL, to learn low-dimensional representations that natively preserve the invariances offered by a given time-series comparison method. GRAIL represents a departure from classic approaches in the time-series literature where representation methods are agnostic to the similarity function used in subsequent learning processes. GRAIL relies on the attractive idea that once we construct the data-to-data similarity matrix most time-series mining tasks can be trivially solved. To overcome scalability issues associated with approaches relying on such matrices, GRAIL exploits time-series clustering to construct a small set of landmark time series and learns representations to reduce the data-to-data matrix to a data-to-landmark points matrix. To demonstrate the effectiveness of GRAIL, we first present domain-independent, highly accurate, and scalable time-series clustering methods to facilitate exploration and summarization of time-series collections. Then, we show that GRAIL representations, when combined with suitable methods, significantly outperform, in terms of efficiency and accuracy, state-of-the-art methods in major time-series mining tasks, such as querying, clustering, classification, sampling, and visualization. Overall, GRAIL rises as a new primitive for highly accurate, yet scalable, time-series analysis. For applications where the analysis of time series complements the analysis of other types of data, such as text and graphs, we propose generic, simple, and lightweight methodologies to learn features from time-varying measurements. Such applications often organize operations over different types of data in a pipeline such that one operation provides input---in the form of feature vectors---to subsequent operations. To reason about the temporal patterns and trends in the underlying features, we need to (i) track the evolution of features over different time periods; and (ii) transform these time-varying features into actionable knowledge (e.g., forecasting an outcome). To address this challenging problem, we propose principled approaches to model time-varying features and study two large-scale, real-world, applications. Specifically, we first study the problem of predicting the impact of scientific concepts through temporal analysis of characteristics extracted from the metadata and full text of scientific articles. Then, we explore the promise of harnessing temporal patterns in behavioral signals extracted from web search engine logs for early detection of devastating diseases. In both applications, combinations of features with time-series relevant features yielded the greatest impact than any other indicator considered in our analysis. We believe that our simple methodology, along with the interesting domain-specific findings that our work revealed, will motivate new studies across different scientific and industrial settings

Columbia University Academic Commons

Quantitative Analysis for Authentication of Low-cost RFID Tags

Author: Basagiannis Stylianos
Paparrizos Ioannis
Petridou Sophia
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/12/2011
Field of study

Formal analysis techniques are widely used today in order to verify and analyze communication protocols. In this work, we launch a quantitative verification analysis for the low- cost Radio Frequency Identification (RFID) protocol proposed by Song and Mitchell. The analysis exploits a Discrete-Time Markov Chain (DTMC) using the well-known PRISM model checker. We have managed to represent up to 100 RFID tags communicating with a reader and quantify each RFID session according to the protocol's computation and transmission cost requirements. As a consequence, not only does the proposed analysis provide quantitative verification results, but also it constitutes a methodology for RFID designers who want to validate their products under specific cost requirements.Comment: To appear in the 36th IEEE Conference on Local Computer Networks (LCN 2011

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Advanced Search, Visualization and Tagging of Sensor Metadata

Author: Aberer Karl
Jeung Hoyoung
Paparrizos Ioannis
Publication venue: Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa
Publication date: 22/02/2011
Field of study

As sensors continue to proliferate, the capabilities of effectively querying not only sensor data but also its metadata becomes important in a wide range of applications. This paper demonstrates a search system that utilizes various techniques and tools for querying sensor metadata and visualizing the results. Our system provides an easy-to-use query interface, built upon semantic technologies where users can freely store and query their metadata. Going beyond basic keyword search, the system provides a variety of advanced functionalities tailored for sensor metadata search; ordering search results according to our ranking mechanism based on the PageRank algorithm, recommending pages that contain relevant metadata information to given search conditions, presenting search results using various visualization tools, and offering dynamic hypergraphs and tag clouds of metadata. The system has been running as a real application and its effectiveness has been proved by a number of users

Infoscience - École polytechnique fédérale de Lausanne

A computational framework for complex disease stratification from multiple large-scale datasets.

Author: A Daponte
A Gambineri
A Gelman
A Xanthoulis
AK Jain
Alexander Manta
Alexander Mazein
Alun Bedding
Amphun Chaiboonchoe
Anthony Postle
Anthony Rowe
Aruna T. Bansal
AS Shirkhorshidi
B Berger
B Bonev
B Clarke
B Mayer
B Wang
Bertrand De Meulder
C Angermueller
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Sanchez
C Sommer
CB Powell
Charles Auffray
Christopher Larminie
CS Leung
CV Cannistraci
D Croft
D Downie
D Hanahan
D Hanahan
D Kim
D Noble
D Noble
D Reimer
D Szklarczyk
DC Corney
DG Altman
Diane Lefaudeux
DL Longo
E Lengyel
E Odegaard
EH Davidson
FM van der Kloet
Frédéric Baribaud
G Cesareni
G Pietu
G Poste
G Scardoni
GP Anderson
H Chen
H Hemingway
H Kitano
H Nakano
H Tweeddale
H Wang
Hassan Ahmed
I Guyon
I Kuperstein
Ian M. Adcock
Ioannis Pandis
Irina Balaur
J Bigler
J Cao
J Frede
J Landskron
J Paparrizos
J Permuth-Wey
J Reimand
J Sampath
J Sung
J Szkandera
J Xie
J Ye
JA Lee
JH Hurst
JH Kim
JH McDonald
JL DeRisi
JL Jameson
Johann Pellet
Jonathan van Eyll
JP Brunet
JT Leek
Julie Corfield
JX Wang
K Kishimoto
K Lindpaintner
K Miyata
K Sundfeldt
KA Fujita
KA Le Cao
Kai Sun
Kian Fan Chung
KN Townsend
Kosmas Kretsos
L He
L Hood
L Hood
L Jin
L Nottale
L Stanberry
LG Hudson
LM McShane
LM McShane
LY Nie
M Bersanelli
M Carlsten
M Kuhn
M Milacic
M Sagner
M Viceconti
Manohara Batuwitage
Mansoor Saqi
MC Archer
MD Mesarovic
MD Ritchie
MD Wilkerson
ME Smoot
MR Wilkins
N Jin
Nathanaël Lemonnier
NG Gavalas
NM White
O Gevaert
O Wolkenhauer
P James
P Khatri
P Khatri
P Kirk
P Langfelder
P Peduzzi
P Sobradillo
PA Estevez
Paul Dodson
PE Meyer
Peter J. Sterk
PJ Sterk
PY Lum
Q Chen
Q Feng
Q Zhang
R Chen
R Shen
R Varshavsky
Ratko Djukanovic
RC Bast Jr
REJ Howells
RF Machado
RR Vallabhajosyula
RW Tothill
S Bellone
S Calza
S Darb-Esfahani
S Hein
S MacKenzie
S Michiels
S Mizuno
S Ogishima
S Zhao
SA Adham
SG Oliver
SJ Tsai
Stéphane Ballereau
SY Rhee
T Ideker
T Ideker
T Ronan
TCGAR Network
Timothy Davison
TL Hawkins
V Lapatas
VE Velculescu
WE Johnson
WS Noble
X Han
Xian Yang
Y Benjamini
Y Guo
Y Li
Y Saeys
Y Yuan
Yi-Ke Guo
Publication venue: BMC Syst Biol
Publication date: 01/05/2018
Field of study

BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-'omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and environmental data. We present a framework to plan and generate single and multi-'omics signatures of disease states. METHODS: The framework is divided into four major steps: dataset subsetting, feature filtering, 'omics-based clustering and biomarker identification. RESULTS: We illustrate the usefulness of this framework by identifying potential patient clusters based on integrated multi-'omics signatures in a publicly available ovarian cystadenocarcinoma dataset. The analysis generated a higher number of stable and clinically relevant clusters than previously reported, and enabled the generation of predictive models of patient outcomes. CONCLUSIONS: This framework will help health researchers plan and perform multi-'omics big data analyses to generate hypotheses and make sense of their rich, diverse and ever growing datasets, to enable implementation of translational P4 medicine

Crossref

Fraunhofer-ePrints

Directory of Open Access Journals

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Apollo (Cambridge)

FigShare

Cognitive neuroscience of vision

Author: KYRIAKOPOULOU Sofia
PAPARRIZOS Ioannis
VOKALI Efthymia
Publication venue
Publication date: 30/08/2011
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Effective Metadata Management in Federated Sensor Networks

Author: Hoyoung Jeung
Ioannis Paparrizos
Karl Aberer
Michael Lehning
Nicholas Dawes
Saket Sathe
Sofiane Sarni
Thanasis G. Papaioannou
Publication venue
Publication date: 01/01/2010
Field of study

Abstract—As sensor networks become increasingly popular, heterogeneous sensor networks are being interconnected into federated sensor networks and provide huge volumes of sensor data to large user communities for a variety of applications. Effective metadata management plays a crucial role in processing and properly interpreting raw sensor measurement data, and needs to be performed in a collaborative fashion. Previous data management work has concentrated on metadata and data as two separate entities and has not provided specific support for joint real-time processing of metadata and sensor data. In this paper we propose a framework that allows effective sensor data and metadata management based on real-time metadata creation and join processing over federated sensor networks. The framework is established on three key mechanisms: (i) distributed metadata joins to allow streaming sensor data to be efficiently processed with their associated metadata, regardless of their location in the network, (ii) automated metadata generation to permit users to define monitoring conditions or operations for extracting and storing metadata from streaming sensor data, (iii) advanced metadata search utilizing various techniques specifically designed for sensor metadata querying and visualization. This framework is currently deployed and used as the backbone of a concrete application in environmental science and engineering, the Swiss Experiment, which runs a wide variety of measurements and experiments for environmental hazard forecasting and warning. I

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref

A computational framework for complex disease stratification from multiple large-scale datasets

Author: A Daponte
A Gambineri
A Gelman
A Xanthoulis
AK Jain
Alexander Manta
Alexander Mazein
Alun Bedding
Amphun Chaiboonchoe
Anthony Postle
Anthony Rowe
Aruna T. Bansal
AS Shirkhorshidi
B Berger
B Bonev
B Clarke
B Mayer
B Wang
Bertrand De Meulder
C Angermueller
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Auffray
C Sanchez
C Sommer
CB Powell
Charles Auffray
Christopher Larminie
CS Leung
CV Cannistraci
D Croft
D Downie
D Hanahan
D Hanahan
D Kim
D Noble
D Noble
D Reimer
D Szklarczyk
DC Corney
DG Altman
Diane Lefaudeux
DL Longo
E Lengyel
E Odegaard
EH Davidson
FM van der Kloet
Frédéric Baribaud
G Cesareni
G Pietu
G Poste
G Scardoni
GP Anderson
H Chen
H Hemingway
H Kitano
H Nakano
H Tweeddale
H Wang
Hassan Ahmed
I Guyon
I Kuperstein
Ian M. Adcock
Ioannis Pandis
Irina Balaur
J Bigler
J Cao
J Frede
J Landskron
J Paparrizos
J Permuth-Wey
J Reimand
J Sampath
J Sung
J Szkandera
J Xie
J Ye
JA Lee
JH Hurst
JH Kim
JH McDonald
JL DeRisi
JL Jameson
Johann Pellet
Jonathan van Eyll
JP Brunet
JT Leek
Julie Corfield
JX Wang
K Kishimoto
K Lindpaintner
K Miyata
K Sundfeldt
KA Fujita
KA Le Cao
Kai Sun
Kian Fan Chung
KN Townsend
Kosmas Kretsos
L He
L Hood
L Hood
L Jin
L Nottale
L Stanberry
LG Hudson
LM McShane
LM McShane
LY Nie
M Bersanelli
M Carlsten
M Kuhn
M Milacic
M Sagner
M Viceconti
Manohara Batuwitage
Mansoor Saqi
MC Archer
MD Mesarovic
MD Ritchie
MD Wilkerson
ME Smoot
MR Wilkins
N Jin
Nathanaël Lemonnier
NG Gavalas
NM White
O Gevaert
O Wolkenhauer
P James
P Khatri
P Khatri
P Kirk
P Langfelder
P Peduzzi
P Sobradillo
PA Estevez
Paul Dodson
PE Meyer
Peter J. Sterk
PJ Sterk
PY Lum
Q Chen
Q Feng
Q Zhang
R Chen
R Shen
R Varshavsky
Ratko Djukanovic
RC Bast Jr
REJ Howells
RF Machado
RR Vallabhajosyula
RW Tothill
S Bellone
S Calza
S Darb-Esfahani
S Hein
S MacKenzie
S Michiels
S Mizuno
S Ogishima
S Zhao
SA Adham
SG Oliver
SJ Tsai
Stéphane Ballereau
SY Rhee
T Ideker
T Ideker
T Ronan
TCGAR Network
Timothy Davison
TL Hawkins
V Lapatas
VE Velculescu
WE Johnson
WS Noble
X Han
Xian Yang
Y Benjamini
Y Guo
Y Li
Y Saeys
Y Yuan
Yi-Ke Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref